Overview
Dataset statistics
| Number of variables | 32 |
|---|---|
| Number of observations | 3000000 |
| Missing cells | 16062854 |
| Missing cells (%) | 16.7% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 2.1 GiB |
| Average record size in memory | 760.1 B |
Variable types
| DateTime | 1 |
|---|---|
| Categorical | 6 |
| Numeric | 21 |
| Text | 4 |
AIRLINE is highly overall correlated with AIRLINE_CODE and 2 other fields | High correlation |
AIRLINE_CODE is highly overall correlated with AIRLINE and 2 other fields | High correlation |
AIRLINE_DOT is highly overall correlated with AIRLINE and 2 other fields | High correlation |
AIR_TIME is highly overall correlated with CANCELLED and 4 other fields | High correlation |
ARR_DELAY is highly overall correlated with CANCELLED and 2 other fields | High correlation |
ARR_TIME is highly overall correlated with CANCELLED and 5 other fields | High correlation |
CANCELLATION_CODE is highly overall correlated with CANCELLED and 1 other fields | High correlation |
CANCELLED is highly overall correlated with AIR_TIME and 11 other fields | High correlation |
CRS_ARR_TIME is highly overall correlated with ARR_TIME and 4 other fields | High correlation |
CRS_DEP_TIME is highly overall correlated with ARR_TIME and 4 other fields | High correlation |
CRS_ELAPSED_TIME is highly overall correlated with AIR_TIME and 2 other fields | High correlation |
DELAY_DUE_CARRIER is highly overall correlated with CANCELLED and 1 other fields | High correlation |
DELAY_DUE_LATE_AIRCRAFT is highly overall correlated with CANCELLED and 1 other fields | High correlation |
DELAY_DUE_NAS is highly overall correlated with CANCELLED and 1 other fields | High correlation |
DELAY_DUE_SECURITY is highly overall correlated with CANCELLED and 1 other fields | High correlation |
DELAY_DUE_WEATHER is highly overall correlated with CANCELLED and 1 other fields | High correlation |
DEP_DELAY is highly overall correlated with ARR_DELAY | High correlation |
DEP_TIME is highly overall correlated with ARR_TIME and 4 other fields | High correlation |
DISTANCE is highly overall correlated with AIR_TIME and 2 other fields | High correlation |
DIVERTED is highly overall correlated with AIR_TIME and 8 other fields | High correlation |
DOT_CODE is highly overall correlated with AIRLINE and 2 other fields | High correlation |
ELAPSED_TIME is highly overall correlated with AIR_TIME and 4 other fields | High correlation |
TAXI_IN is highly overall correlated with CANCELLED | High correlation |
WHEELS_OFF is highly overall correlated with ARR_TIME and 4 other fields | High correlation |
WHEELS_ON is highly overall correlated with ARR_TIME and 5 other fields | High correlation |
CANCELLED is highly imbalanced (82.4%) | Imbalance |
DIVERTED is highly imbalanced (97.6%) | Imbalance |
DEP_TIME has 77615 (2.6%) missing values | Missing |
DEP_DELAY has 77644 (2.6%) missing values | Missing |
TAXI_OUT has 78806 (2.6%) missing values | Missing |
WHEELS_OFF has 78806 (2.6%) missing values | Missing |
WHEELS_ON has 79944 (2.7%) missing values | Missing |
TAXI_IN has 79944 (2.7%) missing values | Missing |
ARR_TIME has 79942 (2.7%) missing values | Missing |
ARR_DELAY has 86198 (2.9%) missing values | Missing |
CANCELLATION_CODE has 2920860 (97.4%) missing values | Missing |
ELAPSED_TIME has 86198 (2.9%) missing values | Missing |
AIR_TIME has 86198 (2.9%) missing values | Missing |
DELAY_DUE_CARRIER has 2466137 (82.2%) missing values | Missing |
DELAY_DUE_WEATHER has 2466137 (82.2%) missing values | Missing |
DELAY_DUE_NAS has 2466137 (82.2%) missing values | Missing |
DELAY_DUE_SECURITY has 2466137 (82.2%) missing values | Missing |
DELAY_DUE_LATE_AIRCRAFT has 2466137 (82.2%) missing values | Missing |
DELAY_DUE_SECURITY is highly skewed (γ1 = 101.2175459) | Skewed |
DEP_DELAY has 141944 (4.7%) zeros | Zeros |
ARR_DELAY has 53685 (1.8%) zeros | Zeros |
DELAY_DUE_CARRIER has 236912 (7.9%) zeros | Zeros |
DELAY_DUE_WEATHER has 502435 (16.7%) zeros | Zeros |
DELAY_DUE_NAS has 277386 (9.2%) zeros | Zeros |
DELAY_DUE_SECURITY has 531104 (17.7%) zeros | Zeros |
DELAY_DUE_LATE_AIRCRAFT has 274849 (9.2%) zeros | Zeros |
Reproduction
| Analysis started | 2025-12-11 05:13:15.263943 |
|---|---|
| Analysis finished | 2025-12-11 05:19:33.564811 |
| Duration | 6 minutes and 18.3 seconds |
| Software version | ydata-profiling vv4.18.0 |
| Download configuration | config.json |
Variables
FL_DATE
Date
| Distinct | 1704 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 22.9 MiB |
| Minimum | 2019-01-01 00:00:00 |
|---|---|
| Maximum | 2023-08-31 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
AIRLINE
Categorical
High correlation
| Distinct | 18 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 219.1 MiB |
| Southwest Airlines Co. | |
|---|---|
| Delta Air Lines Inc. | |
| American Airlines Inc. | |
| SkyWest Airlines Inc. | |
| United Air Lines Inc. | |
| Other values (13) |
Length
| Max length | 34 |
|---|---|
| Median length | 22 |
| Mean length | 19.593653 |
| Min length | 9 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | United Air Lines Inc. |
|---|---|
| 2nd row | Delta Air Lines Inc. |
| 3rd row | United Air Lines Inc. |
| 4th row | Delta Air Lines Inc. |
| 5th row | Spirit Air Lines |
Common Values
| Value | Count | Frequency (%) |
| Southwest Airlines Co. | 576470 | |
| Delta Air Lines Inc. | 395239 | |
| American Airlines Inc. | 383106 | |
| SkyWest Airlines Inc. | 343737 | |
| United Air Lines Inc. | 254504 | |
| Republic Airline | 143107 | 4.8% |
| Envoy Air | 121256 | 4.0% |
| JetBlue Airways | 112844 | 3.8% |
| Endeavor Air Inc. | 112463 | 3.7% |
| PSA Airlines Inc. | 107050 | 3.6% |
| Other values (8) | 450224 |
Length
| Value | Count | Frequency (%) |
| inc | 1858158 | |
| airlines | 1691504 | |
| air | 1052545 | |
| lines | 745454 | |
| southwest | 576470 | 6.2% |
| co | 576470 | 6.2% |
| delta | 395239 | 4.3% |
| american | 383106 | 4.1% |
| skywest | 343737 | 3.7% |
| united | 254504 | 2.8% |
| Other values (18) | 1360141 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 6754270 | |
| 6237328 | 10.6% | |
| n | 5479504 | 9.3% |
| e | 5234759 | 8.9% |
| r | 3759928 | 6.4% |
| s | 3673652 | 6.2% |
| A | 3643361 | 6.2% |
| l | 2691744 | 4.6% |
| t | 2491261 | 4.2% |
| . | 2434628 | 4.1% |
| Other values (33) | 16380523 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 58780958 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| i | 6754270 | |
| 6237328 | 10.6% | |
| n | 5479504 | 9.3% |
| e | 5234759 | 8.9% |
| r | 3759928 | 6.4% |
| s | 3673652 | 6.2% |
| A | 3643361 | 6.2% |
| l | 2691744 | 4.6% |
| t | 2491261 | 4.2% |
| . | 2434628 | 4.1% |
| Other values (33) | 16380523 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 58780958 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| i | 6754270 | |
| 6237328 | 10.6% | |
| n | 5479504 | 9.3% |
| e | 5234759 | 8.9% |
| r | 3759928 | 6.4% |
| s | 3673652 | 6.2% |
| A | 3643361 | 6.2% |
| l | 2691744 | 4.6% |
| t | 2491261 | 4.2% |
| . | 2434628 | 4.1% |
| Other values (33) | 16380523 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 58780958 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| i | 6754270 | |
| 6237328 | 10.6% | |
| n | 5479504 | 9.3% |
| e | 5234759 | 8.9% |
| r | 3759928 | 6.4% |
| s | 3673652 | 6.2% |
| A | 3643361 | 6.2% |
| l | 2691744 | 4.6% |
| t | 2491261 | 4.2% |
| . | 2434628 | 4.1% |
| Other values (33) | 16380523 |
AIRLINE_DOT
Categorical
High correlation
| Distinct | 18 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 230.6 MiB |
| Southwest Airlines Co.: WN | |
|---|---|
| Delta Air Lines Inc.: DL | |
| American Airlines Inc.: AA | |
| SkyWest Airlines Inc.: OO | |
| United Air Lines Inc.: UA | |
| Other values (13) |
Length
| Max length | 38 |
|---|---|
| Median length | 26 |
| Mean length | 23.593653 |
| Min length | 13 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | United Air Lines Inc.: UA |
|---|---|
| 2nd row | Delta Air Lines Inc.: DL |
| 3rd row | United Air Lines Inc.: UA |
| 4th row | Delta Air Lines Inc.: DL |
| 5th row | Spirit Air Lines: NK |
Common Values
| Value | Count | Frequency (%) |
| Southwest Airlines Co.: WN | 576470 | |
| Delta Air Lines Inc.: DL | 395239 | |
| American Airlines Inc.: AA | 383106 | |
| SkyWest Airlines Inc.: OO | 343737 | |
| United Air Lines Inc.: UA | 254504 | |
| Republic Airline: YX | 143107 | 4.8% |
| Envoy Air: MQ | 121256 | 4.0% |
| JetBlue Airways: B6 | 112844 | 3.8% |
| Endeavor Air Inc.: 9E | 112463 | 3.7% |
| PSA Airlines Inc.: OH | 107050 | 3.6% |
| Other values (8) | 450224 |
Length
| Value | Count | Frequency (%) |
| inc | 1858158 | |
| airlines | 1691504 | |
| air | 1052545 | 8.6% |
| lines | 745454 | 6.1% |
| southwest | 576470 | 4.7% |
| wn | 576470 | 4.7% |
| co | 576470 | 4.7% |
| delta | 395239 | 3.2% |
| dl | 395239 | 3.2% |
| american | 383106 | 3.1% |
| Other values (36) | 3986673 |
Most occurring characters
| Value | Count | Frequency (%) |
| 9237328 | 13.1% | |
| i | 6754270 | 9.5% |
| n | 5479504 | 7.7% |
| e | 5234759 | 7.4% |
| A | 4796658 | 6.8% |
| r | 3759928 | 5.3% |
| s | 3673652 | 5.2% |
| : | 3000000 | 4.2% |
| l | 2691744 | 3.8% |
| t | 2491261 | 3.5% |
| Other values (45) | 23661854 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 70780958 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 9237328 | 13.1% | |
| i | 6754270 | 9.5% |
| n | 5479504 | 7.7% |
| e | 5234759 | 7.4% |
| A | 4796658 | 6.8% |
| r | 3759928 | 5.3% |
| s | 3673652 | 5.2% |
| : | 3000000 | 4.2% |
| l | 2691744 | 3.8% |
| t | 2491261 | 3.5% |
| Other values (45) | 23661854 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 70780958 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 9237328 | 13.1% | |
| i | 6754270 | 9.5% |
| n | 5479504 | 7.7% |
| e | 5234759 | 7.4% |
| A | 4796658 | 6.8% |
| r | 3759928 | 5.3% |
| s | 3673652 | 5.2% |
| : | 3000000 | 4.2% |
| l | 2691744 | 3.8% |
| t | 2491261 | 3.5% |
| Other values (45) | 23661854 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 70780958 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 9237328 | 13.1% | |
| i | 6754270 | 9.5% |
| n | 5479504 | 7.7% |
| e | 5234759 | 7.4% |
| A | 4796658 | 6.8% |
| r | 3759928 | 5.3% |
| s | 3673652 | 5.2% |
| : | 3000000 | 4.2% |
| l | 2691744 | 3.8% |
| t | 2491261 | 3.5% |
| Other values (45) | 23661854 |
AIRLINE_CODE
Categorical
High correlation
| Distinct | 18 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 168.8 MiB |
| WN | |
|---|---|
| DL | |
| AA | |
| OO | |
| UA | |
| Other values (13) |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | UA |
|---|---|
| 2nd row | DL |
| 3rd row | UA |
| 4th row | DL |
| 5th row | NK |
Common Values
| Value | Count | Frequency (%) |
| WN | 576470 | |
| DL | 395239 | |
| AA | 383106 | |
| OO | 343737 | |
| UA | 254504 | |
| YX | 143107 | 4.8% |
| MQ | 121256 | 4.0% |
| B6 | 112844 | 3.8% |
| 9E | 112463 | 3.7% |
| OH | 107050 | 3.6% |
| Other values (8) | 450224 |
Length
| Value | Count | Frequency (%) |
| wn | 576470 | |
| dl | 395239 | |
| aa | 383106 | |
| oo | 343737 | |
| ua | 254504 | |
| yx | 143107 | 4.8% |
| mq | 121256 | 4.0% |
| b6 | 112844 | 3.8% |
| 9e | 112463 | 3.7% |
| oh | 107050 | 3.6% |
| Other values (8) | 450224 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 1153297 | |
| O | 794524 | |
| N | 672181 | |
| W | 576470 | |
| D | 395239 | 6.6% |
| L | 395239 | 6.6% |
| U | 254504 | 4.2% |
| Y | 208119 | 3.5% |
| 9 | 176929 | 2.9% |
| X | 163741 | 2.7% |
| Other values (12) | 1209757 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 6000000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| A | 1153297 | |
| O | 794524 | |
| N | 672181 | |
| W | 576470 | |
| D | 395239 | 6.6% |
| L | 395239 | 6.6% |
| U | 254504 | 4.2% |
| Y | 208119 | 3.5% |
| 9 | 176929 | 2.9% |
| X | 163741 | 2.7% |
| Other values (12) | 1209757 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 6000000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| A | 1153297 | |
| O | 794524 | |
| N | 672181 | |
| W | 576470 | |
| D | 395239 | 6.6% |
| L | 395239 | 6.6% |
| U | 254504 | 4.2% |
| Y | 208119 | 3.5% |
| 9 | 176929 | 2.9% |
| X | 163741 | 2.7% |
| Other values (12) | 1209757 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 6000000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| A | 1153297 | |
| O | 794524 | |
| N | 672181 | |
| W | 576470 | |
| D | 395239 | 6.6% |
| L | 395239 | 6.6% |
| U | 254504 | 4.2% |
| Y | 208119 | 3.5% |
| 9 | 176929 | 2.9% |
| X | 163741 | 2.7% |
| Other values (12) | 1209757 |
DOT_CODE
Real number (ℝ)
High correlation
| Distinct | 18 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 19976.294 |
| Minimum | 19393 |
|---|---|
| Maximum | 20452 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.9 MiB |
Quantile statistics
| Minimum | 19393 |
|---|---|
| 5-th percentile | 19393 |
| Q1 | 19790 |
| median | 19930 |
| Q3 | 20368 |
| 95-th percentile | 20436 |
| Maximum | 20452 |
| Range | 1059 |
| Interquartile range (IQR) | 578 |
Descriptive statistics
| Standard deviation | 377.28462 |
|---|---|
| Coefficient of variation (CV) | 0.018886617 |
| Kurtosis | -1.3107567 |
| Mean | 19976.294 |
| Median Absolute Deviation (MAD) | 374 |
| Skewness | -0.22978821 |
| Sum | 5.9928882 × 1010 |
| Variance | 142343.68 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 19393 | 576470 | |
| 19790 | 395239 | |
| 19805 | 383106 | |
| 20304 | 343737 | |
| 19977 | 254504 | |
| 20452 | 143107 | 4.8% |
| 20398 | 121256 | 4.0% |
| 20409 | 112844 | 3.8% |
| 20363 | 112463 | 3.7% |
| 20397 | 107050 | 3.6% |
| Other values (8) | 450224 |
| Value | Count | Frequency (%) |
| 19393 | 576470 | |
| 19687 | 20634 | 0.7% |
| 19690 | 32114 | 1.1% |
| 19790 | 395239 | |
| 19805 | 383106 | |
| 19930 | 100467 | 3.3% |
| 19977 | 254504 | |
| 20304 | 343737 | |
| 20363 | 112463 | 3.7% |
| 20366 | 19082 | 0.6% |
| Value | Count | Frequency (%) |
| 20452 | 143107 | |
| 20436 | 64466 | |
| 20416 | 95711 | |
| 20409 | 112844 | |
| 20398 | 121256 | |
| 20397 | 107050 | |
| 20378 | 65012 | |
| 20368 | 52738 | 1.8% |
| 20366 | 19082 | 0.6% |
| 20363 | 112463 |
FL_NUMBER
Real number (ℝ)
| Distinct | 7111 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2511.5355 |
| Minimum | 1 |
|---|---|
| Maximum | 9562 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.9 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 261 |
| Q1 | 1051 |
| median | 2152 |
| Q3 | 3797 |
| 95-th percentile | 5656 |
| Maximum | 9562 |
| Range | 9561 |
| Interquartile range (IQR) | 2746 |
Descriptive statistics
| Standard deviation | 1747.258 |
|---|---|
| Coefficient of variation (CV) | 0.69569314 |
| Kurtosis | -0.89109021 |
| Mean | 2511.5355 |
| Median Absolute Deviation (MAD) | 1334 |
| Skewness | 0.50763183 |
| Sum | 7.5346066 × 109 |
| Variance | 3052910.7 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 334 | 1221 | < 0.1% |
| 64 | 1216 | < 0.1% |
| 403 | 1196 | < 0.1% |
| 706 | 1191 | < 0.1% |
| 371 | 1185 | < 0.1% |
| 352 | 1169 | < 0.1% |
| 573 | 1147 | < 0.1% |
| 539 | 1145 | < 0.1% |
| 676 | 1138 | < 0.1% |
| 358 | 1138 | < 0.1% |
| Other values (7101) | 2988254 |
| Value | Count | Frequency (%) |
| 1 | 931 | |
| 2 | 886 | |
| 3 | 866 | |
| 4 | 843 | |
| 5 | 663 | |
| 6 | 856 | |
| 7 | 792 | |
| 8 | 753 | |
| 9 | 720 | |
| 10 | 655 |
| Value | Count | Frequency (%) |
| 9562 | 1 | < 0.1% |
| 8819 | 4 | |
| 8818 | 1 | < 0.1% |
| 8816 | 2 | |
| 8815 | 2 | |
| 8814 | 2 | |
| 8812 | 2 | |
| 8811 | 3 | |
| 8810 | 3 | |
| 8809 | 1 | < 0.1% |
ORIGIN
Text
| Distinct | 380 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 171.7 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | FLL |
|---|---|
| 2nd row | MSP |
| 3rd row | DEN |
| 4th row | MSP |
| 5th row | MCO |
| Value | Count | Frequency (%) |
| atl | 153556 | 5.1% |
| dfw | 130334 | 4.3% |
| ord | 122296 | 4.1% |
| den | 119919 | 4.0% |
| clt | 94304 | 3.1% |
| lax | 85872 | 2.9% |
| phx | 74815 | 2.5% |
| las | 73470 | 2.4% |
| sea | 70906 | 2.4% |
| mco | 63883 | 2.1% |
| Other values (370) | 2010645 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 1007533 | 11.2% |
| L | 834835 | 9.3% |
| S | 755056 | 8.4% |
| D | 715438 | 7.9% |
| T | 500379 | 5.6% |
| O | 457462 | 5.1% |
| C | 456042 | 5.1% |
| M | 401890 | 4.5% |
| F | 377357 | 4.2% |
| W | 359441 | 4.0% |
| Other values (16) | 3134567 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 9000000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| A | 1007533 | 11.2% |
| L | 834835 | 9.3% |
| S | 755056 | 8.4% |
| D | 715438 | 7.9% |
| T | 500379 | 5.6% |
| O | 457462 | 5.1% |
| C | 456042 | 5.1% |
| M | 401890 | 4.5% |
| F | 377357 | 4.2% |
| W | 359441 | 4.0% |
| Other values (16) | 3134567 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 9000000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| A | 1007533 | 11.2% |
| L | 834835 | 9.3% |
| S | 755056 | 8.4% |
| D | 715438 | 7.9% |
| T | 500379 | 5.6% |
| O | 457462 | 5.1% |
| C | 456042 | 5.1% |
| M | 401890 | 4.5% |
| F | 377357 | 4.2% |
| W | 359441 | 4.0% |
| Other values (16) | 3134567 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 9000000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| A | 1007533 | 11.2% |
| L | 834835 | 9.3% |
| S | 755056 | 8.4% |
| D | 715438 | 7.9% |
| T | 500379 | 5.6% |
| O | 457462 | 5.1% |
| C | 456042 | 5.1% |
| M | 401890 | 4.5% |
| F | 377357 | 4.2% |
| W | 359441 | 4.0% |
| Other values (16) | 3134567 |
ORIGIN_CITY
Text
| Distinct | 373 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 200.6 MiB |
Length
| Max length | 34 |
|---|---|
| Median length | 29 |
| Mean length | 13.114709 |
| Min length | 8 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Fort Lauderdale, FL |
|---|---|
| 2nd row | Minneapolis, MN |
| 3rd row | Denver, CO |
| 4th row | Minneapolis, MN |
| 5th row | Orlando, FL |
| Value | Count | Frequency (%) |
| tx | 326015 | 4.7% |
| ca | 316863 | 4.5% |
| fl | 259713 | 3.7% |
| ga | 165211 | 2.4% |
| il | 164665 | 2.4% |
| chicago | 157368 | 2.3% |
| atlanta | 153556 | 2.2% |
| san | 150465 | 2.2% |
| ny | 146260 | 2.1% |
| new | 135441 | 1.9% |
| Other values (449) | 5007918 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3983475 | 10.1% | |
| a | 3005775 | 7.6% |
| , | 3000000 | 7.6% |
| o | 2192431 | 5.6% |
| e | 2072173 | 5.3% |
| t | 1942800 | 4.9% |
| n | 1915018 | 4.9% |
| l | 1753494 | 4.5% |
| i | 1507440 | 3.8% |
| r | 1430006 | 3.6% |
| Other values (48) | 16541514 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 39344126 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 3983475 | 10.1% | |
| a | 3005775 | 7.6% |
| , | 3000000 | 7.6% |
| o | 2192431 | 5.6% |
| e | 2072173 | 5.3% |
| t | 1942800 | 4.9% |
| n | 1915018 | 4.9% |
| l | 1753494 | 4.5% |
| i | 1507440 | 3.8% |
| r | 1430006 | 3.6% |
| Other values (48) | 16541514 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 39344126 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 3983475 | 10.1% | |
| a | 3005775 | 7.6% |
| , | 3000000 | 7.6% |
| o | 2192431 | 5.6% |
| e | 2072173 | 5.3% |
| t | 1942800 | 4.9% |
| n | 1915018 | 4.9% |
| l | 1753494 | 4.5% |
| i | 1507440 | 3.8% |
| r | 1430006 | 3.6% |
| Other values (48) | 16541514 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 39344126 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 3983475 | 10.1% | |
| a | 3005775 | 7.6% |
| , | 3000000 | 7.6% |
| o | 2192431 | 5.6% |
| e | 2072173 | 5.3% |
| t | 1942800 | 4.9% |
| n | 1915018 | 4.9% |
| l | 1753494 | 4.5% |
| i | 1507440 | 3.8% |
| r | 1430006 | 3.6% |
| Other values (48) | 16541514 |
DEST
Text
| Distinct | 380 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 171.7 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | EWR |
|---|---|
| 2nd row | SEA |
| 3rd row | MSP |
| 4th row | SFO |
| 5th row | DFW |
| Value | Count | Frequency (%) |
| atl | 153569 | 5.1% |
| dfw | 129770 | 4.3% |
| ord | 123334 | 4.1% |
| den | 119592 | 4.0% |
| clt | 95413 | 3.2% |
| lax | 85621 | 2.9% |
| phx | 75605 | 2.5% |
| las | 73462 | 2.4% |
| sea | 70832 | 2.4% |
| mco | 63818 | 2.1% |
| Other values (370) | 2008984 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 1007429 | 11.2% |
| L | 834687 | 9.3% |
| S | 755323 | 8.4% |
| D | 714881 | 7.9% |
| T | 502444 | 5.6% |
| C | 458231 | 5.1% |
| O | 457402 | 5.1% |
| M | 400386 | 4.4% |
| F | 375758 | 4.2% |
| W | 358043 | 4.0% |
| Other values (16) | 3135416 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 9000000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| A | 1007429 | 11.2% |
| L | 834687 | 9.3% |
| S | 755323 | 8.4% |
| D | 714881 | 7.9% |
| T | 502444 | 5.6% |
| C | 458231 | 5.1% |
| O | 457402 | 5.1% |
| M | 400386 | 4.4% |
| F | 375758 | 4.2% |
| W | 358043 | 4.0% |
| Other values (16) | 3135416 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 9000000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| A | 1007429 | 11.2% |
| L | 834687 | 9.3% |
| S | 755323 | 8.4% |
| D | 714881 | 7.9% |
| T | 502444 | 5.6% |
| C | 458231 | 5.1% |
| O | 457402 | 5.1% |
| M | 400386 | 4.4% |
| F | 375758 | 4.2% |
| W | 358043 | 4.0% |
| Other values (16) | 3135416 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 9000000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| A | 1007429 | 11.2% |
| L | 834687 | 9.3% |
| S | 755323 | 8.4% |
| D | 714881 | 7.9% |
| T | 502444 | 5.6% |
| C | 458231 | 5.1% |
| O | 457402 | 5.1% |
| M | 400386 | 4.4% |
| F | 375758 | 4.2% |
| W | 358043 | 4.0% |
| Other values (16) | 3135416 |
DEST_CITY
Text
| Distinct | 373 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 200.6 MiB |
Length
| Max length | 34 |
|---|---|
| Median length | 29 |
| Mean length | 13.115468 |
| Min length | 8 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Newark, NJ |
|---|---|
| 2nd row | Seattle, WA |
| 3rd row | Minneapolis, MN |
| 4th row | San Francisco, CA |
| 5th row | Dallas/Fort Worth, TX |
| Value | Count | Frequency (%) |
| tx | 325301 | 4.7% |
| ca | 316469 | 4.5% |
| fl | 260353 | 3.7% |
| il | 165395 | 2.4% |
| ga | 164952 | 2.4% |
| chicago | 158087 | 2.3% |
| atlanta | 153569 | 2.2% |
| san | 150105 | 2.2% |
| ny | 145339 | 2.1% |
| nc | 135112 | 1.9% |
| Other values (449) | 5006707 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3981389 | 10.1% | |
| a | 3007585 | 7.6% |
| , | 3000000 | 7.6% |
| o | 2192461 | 5.6% |
| e | 2070816 | 5.3% |
| t | 1944131 | 4.9% |
| n | 1915428 | 4.9% |
| l | 1754841 | 4.5% |
| i | 1509323 | 3.8% |
| r | 1427774 | 3.6% |
| Other values (48) | 16542655 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 39346403 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 3981389 | 10.1% | |
| a | 3007585 | 7.6% |
| , | 3000000 | 7.6% |
| o | 2192461 | 5.6% |
| e | 2070816 | 5.3% |
| t | 1944131 | 4.9% |
| n | 1915428 | 4.9% |
| l | 1754841 | 4.5% |
| i | 1509323 | 3.8% |
| r | 1427774 | 3.6% |
| Other values (48) | 16542655 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 39346403 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 3981389 | 10.1% | |
| a | 3007585 | 7.6% |
| , | 3000000 | 7.6% |
| o | 2192461 | 5.6% |
| e | 2070816 | 5.3% |
| t | 1944131 | 4.9% |
| n | 1915428 | 4.9% |
| l | 1754841 | 4.5% |
| i | 1509323 | 3.8% |
| r | 1427774 | 3.6% |
| Other values (48) | 16542655 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 39346403 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 3981389 | 10.1% | |
| a | 3007585 | 7.6% |
| , | 3000000 | 7.6% |
| o | 2192461 | 5.6% |
| e | 2070816 | 5.3% |
| t | 1944131 | 4.9% |
| n | 1915428 | 4.9% |
| l | 1754841 | 4.5% |
| i | 1509323 | 3.8% |
| r | 1427774 | 3.6% |
| Other values (48) | 16542655 |
CRS_DEP_TIME
Real number (ℝ)
High correlation
| Distinct | 1384 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1327.062 |
| Minimum | 1 |
|---|---|
| Maximum | 2359 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.9 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 605 |
| Q1 | 915 |
| median | 1320 |
| Q3 | 1730 |
| 95-th percentile | 2120 |
| Maximum | 2359 |
| Range | 2358 |
| Interquartile range (IQR) | 815 |
Descriptive statistics
| Standard deviation | 485.87885 |
|---|---|
| Coefficient of variation (CV) | 0.36613124 |
| Kurtosis | -1.0361419 |
| Mean | 1327.062 |
| Median Absolute Deviation (MAD) | 410 |
| Skewness | 0.08706432 |
| Sum | 3.981186 × 109 |
| Variance | 236078.26 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 600 | 63421 | 2.1% |
| 700 | 50527 | 1.7% |
| 800 | 31344 | 1.0% |
| 900 | 21357 | 0.7% |
| 830 | 20447 | 0.7% |
| 630 | 19645 | 0.7% |
| 730 | 19289 | 0.6% |
| 1000 | 19200 | 0.6% |
| 1100 | 18295 | 0.6% |
| 1700 | 15904 | 0.5% |
| Other values (1374) | 2720571 |
| Value | Count | Frequency (%) |
| 1 | 24 | < 0.1% |
| 2 | 15 | < 0.1% |
| 3 | 1 | < 0.1% |
| 4 | 17 | < 0.1% |
| 5 | 136 | |
| 6 | 36 | < 0.1% |
| 7 | 13 | < 0.1% |
| 8 | 16 | < 0.1% |
| 9 | 60 | |
| 10 | 99 |
| Value | Count | Frequency (%) |
| 2359 | 3182 | |
| 2358 | 210 | < 0.1% |
| 2357 | 187 | < 0.1% |
| 2356 | 207 | < 0.1% |
| 2355 | 1600 | |
| 2354 | 154 | < 0.1% |
| 2353 | 121 | < 0.1% |
| 2352 | 148 | < 0.1% |
| 2351 | 82 | < 0.1% |
| 2350 | 845 | < 0.1% |
DEP_TIME
Real number (ℝ)
High correlation Missing
| Distinct | 1440 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 77615 |
| Missing (%) | 2.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1329.7759 |
| Minimum | 1 |
|---|---|
| Maximum | 2400 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.9 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 602 |
| Q1 | 916 |
| median | 1323 |
| Q3 | 1739 |
| 95-th percentile | 2132 |
| Maximum | 2400 |
| Range | 2399 |
| Interquartile range (IQR) | 823 |
Descriptive statistics
| Standard deviation | 499.31005 |
|---|---|
| Coefficient of variation (CV) | 0.37548436 |
| Kurtosis | -0.96882134 |
| Mean | 1329.7759 |
| Median Absolute Deviation (MAD) | 412 |
| Skewness | 0.045195937 |
| Sum | 3.8861172 × 109 |
| Variance | 249310.53 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 555 | 7991 | 0.3% |
| 557 | 7186 | 0.2% |
| 556 | 7184 | 0.2% |
| 655 | 6662 | 0.2% |
| 558 | 6640 | 0.2% |
| 559 | 6214 | 0.2% |
| 554 | 6114 | 0.2% |
| 656 | 6048 | 0.2% |
| 657 | 5962 | 0.2% |
| 600 | 5812 | 0.2% |
| Other values (1430) | 2856572 | |
| (Missing) | 77615 | 2.6% |
| Value | Count | Frequency (%) |
| 1 | 381 | |
| 2 | 286 | |
| 3 | 222 | |
| 4 | 267 | |
| 5 | 265 | |
| 6 | 233 | |
| 7 | 253 | |
| 8 | 250 | |
| 9 | 223 | |
| 10 | 240 |
| Value | Count | Frequency (%) |
| 2400 | 242 | |
| 2359 | 390 | |
| 2358 | 413 | |
| 2357 | 427 | |
| 2356 | 475 | |
| 2355 | 458 | |
| 2354 | 532 | |
| 2353 | 529 | |
| 2352 | 515 | |
| 2351 | 524 |
DEP_DELAY
Real number (ℝ)
High correlation Missing Zeros
| Distinct | 1513 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 77644 |
| Missing (%) | 2.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10.123326 |
| Minimum | -90 |
|---|---|
| Maximum | 2966 |
| Zeros | 141944 |
| Zeros (%) | 4.7% |
| Negative | 1787569 |
| Negative (%) | 59.6% |
| Memory size | 22.9 MiB |
Quantile statistics
| Minimum | -90 |
|---|---|
| 5-th percentile | -10 |
| Q1 | -6 |
| median | -2 |
| Q3 | 6 |
| 95-th percentile | 72 |
| Maximum | 2966 |
| Range | 3056 |
| Interquartile range (IQR) | 12 |
Descriptive statistics
| Standard deviation | 49.251835 |
|---|---|
| Coefficient of variation (CV) | 4.865183 |
| Kurtosis | 243.16697 |
| Mean | 10.123326 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 11.474159 |
| Sum | 29583963 |
| Variance | 2425.7432 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -5 | 240153 | 8.0% |
| -4 | 223241 | 7.4% |
| -3 | 212648 | 7.1% |
| -2 | 191412 | 6.4% |
| -6 | 186442 | 6.2% |
| -1 | 168704 | 5.6% |
| -7 | 153460 | 5.1% |
| 0 | 141944 | 4.7% |
| -8 | 119359 | 4.0% |
| -9 | 88695 | 3.0% |
| Other values (1503) | 1196298 |
| Value | Count | Frequency (%) |
| -90 | 1 | |
| -89 | 1 | |
| -87 | 1 | |
| -82 | 1 | |
| -74 | 1 | |
| -73 | 1 | |
| -68 | 2 | |
| -66 | 1 | |
| -62 | 2 | |
| -57 | 1 |
| Value | Count | Frequency (%) |
| 2966 | 1 | |
| 2938 | 1 | |
| 2905 | 1 | |
| 2903 | 1 | |
| 2895 | 1 | |
| 2884 | 1 | |
| 2690 | 1 | |
| 2579 | 1 | |
| 2574 | 1 | |
| 2565 | 1 |
TAXI_OUT
Real number (ℝ)
Missing
| Distinct | 179 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 78806 |
| Missing (%) | 2.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 16.643046 |
| Minimum | 1 |
|---|---|
| Maximum | 184 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.9 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 8 |
| Q1 | 11 |
| median | 14 |
| Q3 | 19 |
| 95-th percentile | 33 |
| Maximum | 184 |
| Range | 183 |
| Interquartile range (IQR) | 8 |
Descriptive statistics
| Standard deviation | 9.1929012 |
|---|---|
| Coefficient of variation (CV) | 0.55235691 |
| Kurtosis | 23.34913 |
| Mean | 16.643046 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 3.4536312 |
| Sum | 48617565 |
| Variance | 84.509433 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 12 | 244243 | 8.1% |
| 11 | 236130 | 7.9% |
| 13 | 233843 | 7.8% |
| 14 | 214014 | 7.1% |
| 10 | 211251 | 7.0% |
| 15 | 191453 | 6.4% |
| 16 | 165427 | 5.5% |
| 9 | 162480 | 5.4% |
| 17 | 142434 | 4.7% |
| 18 | 121273 | 4.0% |
| Other values (169) | 998646 |
| Value | Count | Frequency (%) |
| 1 | 73 | < 0.1% |
| 2 | 119 | < 0.1% |
| 3 | 541 | < 0.1% |
| 4 | 2130 | 0.1% |
| 5 | 6925 | 0.2% |
| 6 | 23916 | 0.8% |
| 7 | 57235 | 1.9% |
| 8 | 106409 | |
| 9 | 162480 | |
| 10 | 211251 |
| Value | Count | Frequency (%) |
| 184 | 1 | < 0.1% |
| 182 | 1 | < 0.1% |
| 181 | 1 | < 0.1% |
| 177 | 1 | < 0.1% |
| 176 | 2 | |
| 175 | 2 | |
| 174 | 2 | |
| 172 | 2 | |
| 171 | 3 | |
| 170 | 2 |
WHEELS_OFF
Real number (ℝ)
High correlation Missing
| Distinct | 1440 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 78806 |
| Missing (%) | 2.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1352.361 |
| Minimum | 1 |
|---|---|
| Maximum | 2400 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.9 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 617 |
| Q1 | 931 |
| median | 1336 |
| Q3 | 1752 |
| 95-th percentile | 2145 |
| Maximum | 2400 |
| Range | 2399 |
| Interquartile range (IQR) | 821 |
Descriptive statistics
| Standard deviation | 500.87269 |
|---|---|
| Coefficient of variation (CV) | 0.37036907 |
| Kurtosis | -0.9048526 |
| Mean | 1352.361 |
| Median Absolute Deviation (MAD) | 411 |
| Skewness | 0.011307027 |
| Sum | 3.9505088 × 109 |
| Variance | 250873.45 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 608 | 5094 | 0.2% |
| 609 | 5001 | 0.2% |
| 610 | 4889 | 0.2% |
| 611 | 4824 | 0.2% |
| 612 | 4713 | 0.2% |
| 607 | 4663 | 0.2% |
| 613 | 4537 | 0.2% |
| 709 | 4453 | 0.1% |
| 710 | 4405 | 0.1% |
| 708 | 4401 | 0.1% |
| Other values (1430) | 2874214 | |
| (Missing) | 78806 | 2.6% |
| Value | Count | Frequency (%) |
| 1 | 599 | |
| 2 | 453 | |
| 3 | 434 | |
| 4 | 494 | |
| 5 | 459 | |
| 6 | 445 | |
| 7 | 481 | |
| 8 | 453 | |
| 9 | 472 | |
| 10 | 449 |
| Value | Count | Frequency (%) |
| 2400 | 401 | |
| 2359 | 488 | |
| 2358 | 441 | |
| 2357 | 473 | |
| 2356 | 457 | |
| 2355 | 457 | |
| 2354 | 458 | |
| 2353 | 440 | |
| 2352 | 442 | |
| 2351 | 463 |
WHEELS_ON
Real number (ℝ)
High correlation Missing
| Distinct | 1440 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 79944 |
| Missing (%) | 2.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1462.4996 |
| Minimum | 1 |
|---|---|
| Maximum | 2400 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.9 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 654 |
| Q1 | 1049 |
| median | 1501 |
| Q3 | 1908 |
| 95-th percentile | 2246 |
| Maximum | 2400 |
| Range | 2399 |
| Interquartile range (IQR) | 859 |
Descriptive statistics
| Standard deviation | 527.23682 |
|---|---|
| Coefficient of variation (CV) | 0.36050391 |
| Kurtosis | -0.43422672 |
| Mean | 1462.4996 |
| Median Absolute Deviation (MAD) | 417 |
| Skewness | -0.31554992 |
| Sum | 4.2705806 × 109 |
| Variance | 277978.66 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1629 | 3330 | 0.1% |
| 1620 | 3268 | 0.1% |
| 1626 | 3267 | 0.1% |
| 1624 | 3260 | 0.1% |
| 1627 | 3260 | 0.1% |
| 1637 | 3238 | 0.1% |
| 1642 | 3226 | 0.1% |
| 1632 | 3223 | 0.1% |
| 1625 | 3206 | 0.1% |
| 1621 | 3193 | 0.1% |
| Other values (1430) | 2887585 | |
| (Missing) | 79944 | 2.7% |
| Value | Count | Frequency (%) |
| 1 | 1505 | |
| 2 | 1273 | |
| 3 | 1265 | |
| 4 | 1288 | |
| 5 | 1238 | |
| 6 | 1205 | |
| 7 | 1172 | |
| 8 | 1141 | |
| 9 | 1116 | |
| 10 | 1138 |
| Value | Count | Frequency (%) |
| 2400 | 1109 | |
| 2359 | 1355 | |
| 2358 | 1439 | |
| 2357 | 1434 | |
| 2356 | 1428 | |
| 2355 | 1575 | |
| 2354 | 1536 | |
| 2353 | 1615 | |
| 2352 | 1616 | |
| 2351 | 1679 |
TAXI_IN
Real number (ℝ)
High correlation Missing
| Distinct | 202 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 79944 |
| Missing (%) | 2.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.6789822 |
| Minimum | 1 |
|---|---|
| Maximum | 249 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.9 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 4 |
| median | 6 |
| Q3 | 9 |
| 95-th percentile | 18 |
| Maximum | 249 |
| Range | 248 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 6.2696393 |
|---|---|
| Coefficient of variation (CV) | 0.81646749 |
| Kurtosis | 59.921478 |
| Mean | 7.6789822 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 5.0793158 |
| Sum | 22423058 |
| Variance | 39.308377 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4 | 453050 | |
| 5 | 422185 | |
| 6 | 343577 | |
| 3 | 311862 | |
| 7 | 275467 | |
| 8 | 205931 | |
| 9 | 158342 | 5.3% |
| 10 | 123961 | 4.1% |
| 11 | 96043 | 3.2% |
| 2 | 91993 | 3.1% |
| Other values (192) | 437645 | |
| (Missing) | 79944 | 2.7% |
| Value | Count | Frequency (%) |
| 1 | 5474 | 0.2% |
| 2 | 91993 | 3.1% |
| 3 | 311862 | |
| 4 | 453050 | |
| 5 | 422185 | |
| 6 | 343577 | |
| 7 | 275467 | |
| 8 | 205931 | |
| 9 | 158342 | 5.3% |
| 10 | 123961 | 4.1% |
| Value | Count | Frequency (%) |
| 249 | 1 | |
| 244 | 1 | |
| 240 | 1 | |
| 235 | 2 | |
| 233 | 1 | |
| 232 | 1 | |
| 229 | 1 | |
| 225 | 1 | |
| 222 | 1 | |
| 217 | 1 |
CRS_ARR_TIME
Real number (ℝ)
High correlation
| Distinct | 1435 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1490.5607 |
| Minimum | 1 |
|---|---|
| Maximum | 2400 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.9 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 725 |
| Q1 | 1107 |
| median | 1516 |
| Q3 | 1919 |
| 95-th percentile | 2254 |
| Maximum | 2400 |
| Range | 2399 |
| Interquartile range (IQR) | 812 |
Descriptive statistics
| Standard deviation | 511.54757 |
|---|---|
| Coefficient of variation (CV) | 0.34319138 |
| Kurtosis | -0.4738738 |
| Mean | 1490.5607 |
| Median Absolute Deviation (MAD) | 406 |
| Skewness | -0.27594661 |
| Sum | 4.471682 × 109 |
| Variance | 261680.91 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2359 | 9684 | 0.3% |
| 1900 | 9065 | 0.3% |
| 950 | 8220 | 0.3% |
| 2100 | 8166 | 0.3% |
| 1810 | 7642 | 0.3% |
| 1000 | 7619 | 0.3% |
| 1710 | 7596 | 0.3% |
| 1400 | 7579 | 0.3% |
| 1845 | 7579 | 0.3% |
| 2030 | 7520 | 0.3% |
| Other values (1425) | 2919330 |
| Value | Count | Frequency (%) |
| 1 | 810 | < 0.1% |
| 2 | 649 | < 0.1% |
| 3 | 651 | < 0.1% |
| 4 | 630 | < 0.1% |
| 5 | 2628 | |
| 6 | 664 | < 0.1% |
| 7 | 643 | < 0.1% |
| 8 | 527 | < 0.1% |
| 9 | 632 | < 0.1% |
| 10 | 2075 |
| Value | Count | Frequency (%) |
| 2400 | 14 | < 0.1% |
| 2359 | 9684 | |
| 2358 | 2950 | 0.1% |
| 2357 | 2664 | 0.1% |
| 2356 | 2403 | 0.1% |
| 2355 | 5395 | |
| 2354 | 2033 | 0.1% |
| 2353 | 1954 | 0.1% |
| 2352 | 1786 | 0.1% |
| 2351 | 1675 | 0.1% |
ARR_TIME
Real number (ℝ)
High correlation Missing
| Distinct | 1440 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 79942 |
| Missing (%) | 2.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1466.5112 |
| Minimum | 1 |
|---|---|
| Maximum | 2400 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.9 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 653 |
| Q1 | 1053 |
| median | 1505 |
| Q3 | 1913 |
| 95-th percentile | 2249 |
| Maximum | 2400 |
| Range | 2399 |
| Interquartile range (IQR) | 860 |
Descriptive statistics
| Standard deviation | 531.83835 |
|---|---|
| Coefficient of variation (CV) | 0.36265551 |
| Kurtosis | -0.35873693 |
| Mean | 1466.5112 |
| Median Absolute Deviation (MAD) | 415 |
| Skewness | -0.35542043 |
| Sum | 4.2822977 × 109 |
| Variance | 282852.03 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1628 | 3246 | 0.1% |
| 1620 | 3233 | 0.1% |
| 1645 | 3232 | 0.1% |
| 1631 | 3232 | 0.1% |
| 1625 | 3229 | 0.1% |
| 1633 | 3223 | 0.1% |
| 1635 | 3220 | 0.1% |
| 1632 | 3183 | 0.1% |
| 1629 | 3181 | 0.1% |
| 1624 | 3174 | 0.1% |
| Other values (1430) | 2887905 | |
| (Missing) | 79942 | 2.7% |
| Value | Count | Frequency (%) |
| 1 | 1700 | |
| 2 | 1513 | |
| 3 | 1384 | |
| 4 | 1400 | |
| 5 | 1474 | |
| 6 | 1324 | |
| 7 | 1420 | |
| 8 | 1312 | |
| 9 | 1271 | |
| 10 | 1292 |
| Value | Count | Frequency (%) |
| 2400 | 1416 | |
| 2359 | 1570 | |
| 2358 | 1608 | |
| 2357 | 1763 | |
| 2356 | 1750 | |
| 2355 | 1749 | |
| 2354 | 1843 | |
| 2353 | 1780 | |
| 2352 | 1840 | |
| 2351 | 1912 |
ARR_DELAY
Real number (ℝ)
High correlation Missing Zeros
| Distinct | 1527 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 86198 |
| Missing (%) | 2.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.2608582 |
| Minimum | -96 |
|---|---|
| Maximum | 2934 |
| Zeros | 53685 |
| Zeros (%) | 1.8% |
| Negative | 1881970 |
| Negative (%) | 62.7% |
| Memory size | 22.9 MiB |
Quantile statistics
| Minimum | -96 |
|---|---|
| 5-th percentile | -27 |
| Q1 | -16 |
| median | -7 |
| Q3 | 7 |
| 95-th percentile | 71 |
| Maximum | 2934 |
| Range | 3030 |
| Interquartile range (IQR) | 23 |
Descriptive statistics
| Standard deviation | 51.174824 |
|---|---|
| Coefficient of variation (CV) | 12.01045 |
| Kurtosis | 209.02717 |
| Mean | 4.2608582 |
| Median Absolute Deviation (MAD) | 10 |
| Skewness | 10.293493 |
| Sum | 12415297 |
| Variance | 2618.8626 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -11 | 86647 | 2.9% |
| -12 | 86584 | 2.9% |
| -13 | 85598 | 2.9% |
| -10 | 85400 | 2.8% |
| -9 | 84583 | 2.8% |
| -14 | 82794 | 2.8% |
| -8 | 81987 | 2.7% |
| -15 | 79719 | 2.7% |
| -7 | 79462 | 2.6% |
| -16 | 76738 | 2.6% |
| Other values (1517) | 2084290 | |
| (Missing) | 86198 | 2.9% |
| Value | Count | Frequency (%) |
| -96 | 3 | |
| -95 | 1 | < 0.1% |
| -88 | 1 | < 0.1% |
| -86 | 2 | |
| -85 | 1 | < 0.1% |
| -84 | 2 | |
| -83 | 1 | < 0.1% |
| -82 | 3 | |
| -81 | 3 | |
| -80 | 3 |
| Value | Count | Frequency (%) |
| 2934 | 1 | |
| 2913 | 1 | |
| 2912 | 1 | |
| 2911 | 1 | |
| 2900 | 2 | |
| 2685 | 1 | |
| 2568 | 1 | |
| 2565 | 1 | |
| 2560 | 1 | |
| 2556 | 1 |
CANCELLED
Categorical
High correlation Imbalance
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 171.7 MiB |
| 0.0 | |
|---|---|
| 1.0 | 79140 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 2920860 | |
| 1.0 | 79140 | 2.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 2920860 | |
| 1.0 | 79140 | 2.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 5920860 | |
| . | 3000000 | |
| 1 | 79140 | 0.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 9000000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 5920860 | |
| . | 3000000 | |
| 1 | 79140 | 0.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 9000000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 5920860 | |
| . | 3000000 | |
| 1 | 79140 | 0.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 9000000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 5920860 | |
| . | 3000000 | |
| 1 | 79140 | 0.9% |
CANCELLATION_CODE
Categorical
High correlation Missing
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2920860 |
| Missing (%) | 97.4% |
| Memory size | 182.7 MiB |
| B | |
|---|---|
| D | |
| A | |
| C |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | D |
|---|---|
| 2nd row | B |
| 3rd row | D |
| 4th row | A |
| 5th row | D |
Common Values
| Value | Count | Frequency (%) |
| B | 28772 | 1.0% |
| D | 24417 | 0.8% |
| A | 19476 | 0.6% |
| C | 6475 | 0.2% |
| (Missing) | 2920860 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| b | 28772 | |
| d | 24417 | |
| a | 19476 | |
| c | 6475 | 8.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| B | 28772 | |
| D | 24417 | |
| A | 19476 | |
| C | 6475 | 8.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 79140 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| B | 28772 | |
| D | 24417 | |
| A | 19476 | |
| C | 6475 | 8.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 79140 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| B | 28772 | |
| D | 24417 | |
| A | 19476 | |
| C | 6475 | 8.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 79140 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| B | 28772 | |
| D | 24417 | |
| A | 19476 | |
| C | 6475 | 8.2% |
DIVERTED
Categorical
High correlation Imbalance
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 171.7 MiB |
| 0.0 | |
|---|---|
| 1.0 | 7056 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 2992944 | |
| 1.0 | 7056 | 0.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 2992944 | |
| 1.0 | 7056 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 5992944 | |
| . | 3000000 | |
| 1 | 7056 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 9000000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 5992944 | |
| . | 3000000 | |
| 1 | 7056 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 9000000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 5992944 | |
| . | 3000000 | |
| 1 | 7056 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 9000000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 5992944 | |
| . | 3000000 | |
| 1 | 7056 | 0.1% |
CRS_ELAPSED_TIME
Real number (ℝ)
High correlation
| Distinct | 640 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 14 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 142.27581 |
| Minimum | 1 |
|---|---|
| Maximum | 705 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.9 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 63 |
| Q1 | 90 |
| median | 125 |
| Q3 | 172 |
| 95-th percentile | 298 |
| Maximum | 705 |
| Range | 704 |
| Interquartile range (IQR) | 82 |
Descriptive statistics
| Standard deviation | 71.55669 |
|---|---|
| Coefficient of variation (CV) | 0.50294348 |
| Kurtosis | 2.5593398 |
| Mean | 142.27581 |
| Median Absolute Deviation (MAD) | 40 |
| Skewness | 1.4327609 |
| Sum | 4.2682543 × 108 |
| Variance | 5120.3598 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 90 | 57102 | 1.9% |
| 85 | 55629 | 1.9% |
| 80 | 50520 | 1.7% |
| 75 | 46232 | 1.5% |
| 70 | 46057 | 1.5% |
| 95 | 44361 | 1.5% |
| 110 | 39177 | 1.3% |
| 105 | 37458 | 1.2% |
| 115 | 37140 | 1.2% |
| 100 | 36935 | 1.2% |
| Other values (630) | 2549375 |
| Value | Count | Frequency (%) |
| 1 | 1 | < 0.1% |
| 18 | 3 | < 0.1% |
| 20 | 59 | |
| 21 | 28 | |
| 22 | 41 | |
| 23 | 48 | |
| 24 | 68 | |
| 25 | 44 | |
| 26 | 28 | |
| 27 | 9 | < 0.1% |
| Value | Count | Frequency (%) |
| 705 | 1 | < 0.1% |
| 697 | 1 | < 0.1% |
| 695 | 3 | < 0.1% |
| 690 | 9 | < 0.1% |
| 685 | 38 | |
| 684 | 2 | < 0.1% |
| 682 | 4 | < 0.1% |
| 681 | 1 | < 0.1% |
| 680 | 30 | |
| 679 | 4 | < 0.1% |
ELAPSED_TIME
Real number (ℝ)
High correlation Missing
| Distinct | 696 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 86198 |
| Missing (%) | 2.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 136.62054 |
| Minimum | 15 |
|---|---|
| Maximum | 739 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.9 MiB |
Quantile statistics
| Minimum | 15 |
|---|---|
| 5-th percentile | 56 |
| Q1 | 84 |
| median | 120 |
| Q3 | 167 |
| 95-th percentile | 291 |
| Maximum | 739 |
| Range | 724 |
| Interquartile range (IQR) | 83 |
Descriptive statistics
| Standard deviation | 71.675816 |
|---|---|
| Coefficient of variation (CV) | 0.52463425 |
| Kurtosis | 2.497195 |
| Mean | 136.62054 |
| Median Absolute Deviation (MAD) | 40 |
| Skewness | 1.4078756 |
| Sum | 3.9808521 × 108 |
| Variance | 5137.4225 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 79 | 23941 | 0.8% |
| 80 | 23458 | 0.8% |
| 83 | 23417 | 0.8% |
| 81 | 23354 | 0.8% |
| 82 | 23307 | 0.8% |
| 77 | 23280 | 0.8% |
| 78 | 23128 | 0.8% |
| 76 | 22872 | 0.8% |
| 84 | 22793 | 0.8% |
| 85 | 22749 | 0.8% |
| Other values (686) | 2681503 | |
| (Missing) | 86198 | 2.9% |
| Value | Count | Frequency (%) |
| 15 | 3 | < 0.1% |
| 16 | 6 | < 0.1% |
| 17 | 18 | |
| 18 | 24 | |
| 19 | 24 | |
| 20 | 27 | |
| 21 | 32 | |
| 22 | 36 | |
| 23 | 35 | |
| 24 | 28 |
| Value | Count | Frequency (%) |
| 739 | 1 | |
| 722 | 1 | |
| 720 | 1 | |
| 719 | 1 | |
| 718 | 1 | |
| 716 | 1 | |
| 714 | 2 | |
| 713 | 1 | |
| 712 | 1 | |
| 710 | 1 |
AIR_TIME
Real number (ℝ)
High correlation Missing
| Distinct | 666 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 86198 |
| Missing (%) | 2.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 112.31084 |
| Minimum | 8 |
|---|---|
| Maximum | 692 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.9 MiB |
Quantile statistics
| Minimum | 8 |
|---|---|
| 5-th percentile | 35 |
| Q1 | 61 |
| median | 95 |
| Q3 | 142 |
| 95-th percentile | 265 |
| Maximum | 692 |
| Range | 684 |
| Interquartile range (IQR) | 81 |
Descriptive statistics
| Standard deviation | 69.754843 |
|---|---|
| Coefficient of variation (CV) | 0.62108737 |
| Kurtosis | 2.5697142 |
| Mean | 112.31084 |
| Median Absolute Deviation (MAD) | 38 |
| Skewness | 1.4409594 |
| Sum | 3.2725155 × 108 |
| Variance | 4865.7382 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 62 | 25471 | 0.8% |
| 63 | 25366 | 0.8% |
| 61 | 25171 | 0.8% |
| 64 | 25128 | 0.8% |
| 60 | 25055 | 0.8% |
| 65 | 24784 | 0.8% |
| 59 | 24720 | 0.8% |
| 56 | 24625 | 0.8% |
| 55 | 24619 | 0.8% |
| 57 | 24480 | 0.8% |
| Other values (656) | 2664383 | |
| (Missing) | 86198 | 2.9% |
| Value | Count | Frequency (%) |
| 8 | 13 | < 0.1% |
| 9 | 91 | < 0.1% |
| 10 | 101 | < 0.1% |
| 11 | 49 | < 0.1% |
| 12 | 53 | < 0.1% |
| 13 | 91 | < 0.1% |
| 14 | 104 | < 0.1% |
| 15 | 256 | < 0.1% |
| 16 | 567 | |
| 17 | 1004 |
| Value | Count | Frequency (%) |
| 692 | 1 | |
| 690 | 1 | |
| 689 | 1 | |
| 678 | 1 | |
| 677 | 1 | |
| 674 | 2 | |
| 670 | 1 | |
| 669 | 1 | |
| 668 | 1 | |
| 667 | 1 |
DISTANCE
Real number (ℝ)
High correlation
| Distinct | 1727 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 809.36155 |
| Minimum | 29 |
|---|---|
| Maximum | 5812 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.9 MiB |
Quantile statistics
| Minimum | 29 |
|---|---|
| 5-th percentile | 172 |
| Q1 | 377 |
| median | 651 |
| Q3 | 1046 |
| 95-th percentile | 2139 |
| Maximum | 5812 |
| Range | 5783 |
| Interquartile range (IQR) | 669 |
Descriptive statistics
| Standard deviation | 587.89394 |
|---|---|
| Coefficient of variation (CV) | 0.72636751 |
| Kurtosis | 2.843949 |
| Mean | 809.36155 |
| Median Absolute Deviation (MAD) | 318 |
| Skewness | 1.4980542 |
| Sum | 2.4280847 × 109 |
| Variance | 345619.28 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 337 | 17618 | 0.6% |
| 296 | 13873 | 0.5% |
| 399 | 13655 | 0.5% |
| 594 | 12180 | 0.4% |
| 224 | 12138 | 0.4% |
| 733 | 11755 | 0.4% |
| 404 | 11635 | 0.4% |
| 862 | 11604 | 0.4% |
| 214 | 11574 | 0.4% |
| 867 | 11198 | 0.4% |
| Other values (1717) | 2872770 |
| Value | Count | Frequency (%) |
| 29 | 2 | < 0.1% |
| 30 | 11 | < 0.1% |
| 31 | 321 | |
| 41 | 93 | < 0.1% |
| 43 | 5 | < 0.1% |
| 45 | 184 | |
| 46 | 2 | < 0.1% |
| 50 | 31 | < 0.1% |
| 54 | 10 | < 0.1% |
| 61 | 92 | < 0.1% |
| Value | Count | Frequency (%) |
| 5812 | 1 | < 0.1% |
| 5095 | 139 | |
| 4983 | 322 | |
| 4962 | 255 | |
| 4904 | 41 | < 0.1% |
| 4817 | 120 | < 0.1% |
| 4757 | 46 | < 0.1% |
| 4678 | 51 | < 0.1% |
| 4502 | 269 | |
| 4475 | 61 | < 0.1% |
DELAY_DUE_CARRIER
Real number (ℝ)
High correlation Missing Zeros
| Distinct | 1291 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 2466137 |
| Missing (%) | 82.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 24.759086 |
| Minimum | 0 |
|---|---|
| Maximum | 2934 |
| Zeros | 236912 |
| Zeros (%) | 7.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.9 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 4 |
| Q3 | 23 |
| 95-th percentile | 104 |
| Maximum | 2934 |
| Range | 2934 |
| Interquartile range (IQR) | 23 |
Descriptive statistics
| Standard deviation | 71.771845 |
|---|---|
| Coefficient of variation (CV) | 2.8988083 |
| Kurtosis | 159.72182 |
| Mean | 24.759086 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 9.9998181 |
| Sum | 13217960 |
| Variance | 5151.1977 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 236912 | 7.9% |
| 15 | 9102 | 0.3% |
| 1 | 9073 | 0.3% |
| 2 | 8941 | 0.3% |
| 3 | 8670 | 0.3% |
| 6 | 8650 | 0.3% |
| 4 | 8507 | 0.3% |
| 16 | 8193 | 0.3% |
| 5 | 8166 | 0.3% |
| 7 | 8114 | 0.3% |
| Other values (1281) | 219535 | 7.3% |
| (Missing) | 2466137 |
| Value | Count | Frequency (%) |
| 0 | 236912 | |
| 1 | 9073 | 0.3% |
| 2 | 8941 | 0.3% |
| 3 | 8670 | 0.3% |
| 4 | 8507 | 0.3% |
| 5 | 8166 | 0.3% |
| 6 | 8650 | 0.3% |
| 7 | 8114 | 0.3% |
| 8 | 7653 | 0.3% |
| 9 | 7237 | 0.2% |
| Value | Count | Frequency (%) |
| 2934 | 1 | |
| 2913 | 1 | |
| 2903 | 1 | |
| 2884 | 1 | |
| 2685 | 1 | |
| 2565 | 1 | |
| 2560 | 1 | |
| 2556 | 1 | |
| 2522 | 1 | |
| 2308 | 1 |
DELAY_DUE_WEATHER
Real number (ℝ)
High correlation Missing Zeros
| Distinct | 812 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 2466137 |
| Missing (%) | 82.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.9852603 |
| Minimum | 0 |
|---|---|
| Maximum | 1653 |
| Zeros | 502435 |
| Zeros (%) | 16.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.9 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 9 |
| Maximum | 1653 |
| Range | 1653 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 32.410796 |
|---|---|
| Coefficient of variation (CV) | 8.1326673 |
| Kurtosis | 516.40848 |
| Mean | 3.9852603 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 19.223965 |
| Sum | 2127583 |
| Variance | 1050.4597 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 502435 | 16.7% |
| 15 | 678 | < 0.1% |
| 8 | 613 | < 0.1% |
| 17 | 596 | < 0.1% |
| 16 | 595 | < 0.1% |
| 7 | 580 | < 0.1% |
| 19 | 567 | < 0.1% |
| 1 | 566 | < 0.1% |
| 2 | 566 | < 0.1% |
| 3 | 556 | < 0.1% |
| Other values (802) | 26111 | 0.9% |
| (Missing) | 2466137 |
| Value | Count | Frequency (%) |
| 0 | 502435 | |
| 1 | 566 | < 0.1% |
| 2 | 566 | < 0.1% |
| 3 | 556 | < 0.1% |
| 4 | 503 | < 0.1% |
| 5 | 545 | < 0.1% |
| 6 | 550 | < 0.1% |
| 7 | 580 | < 0.1% |
| 8 | 613 | < 0.1% |
| 9 | 552 | < 0.1% |
| Value | Count | Frequency (%) |
| 1653 | 1 | |
| 1486 | 1 | |
| 1459 | 1 | |
| 1439 | 1 | |
| 1416 | 1 | |
| 1398 | 1 | |
| 1389 | 2 | |
| 1332 | 1 | |
| 1326 | 1 | |
| 1324 | 1 |
DELAY_DUE_NAS
Real number (ℝ)
High correlation Missing Zeros
| Distinct | 671 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 2466137 |
| Missing (%) | 82.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13.164728 |
| Minimum | 0 |
|---|---|
| Maximum | 1741 |
| Zeros | 277386 |
| Zeros (%) | 9.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.9 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 17 |
| 95-th percentile | 55 |
| Maximum | 1741 |
| Range | 1741 |
| Interquartile range (IQR) | 17 |
Descriptive statistics
| Standard deviation | 33.161122 |
|---|---|
| Coefficient of variation (CV) | 2.5189371 |
| Kurtosis | 282.19185 |
| Mean | 13.164728 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 11.586589 |
| Sum | 7028161 |
| Variance | 1099.66 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 277386 | 9.2% |
| 1 | 12458 | 0.4% |
| 15 | 10410 | 0.3% |
| 2 | 9672 | 0.3% |
| 16 | 9306 | 0.3% |
| 3 | 8959 | 0.3% |
| 4 | 8664 | 0.3% |
| 17 | 8270 | 0.3% |
| 5 | 8172 | 0.3% |
| 18 | 7712 | 0.3% |
| Other values (661) | 172854 | 5.8% |
| (Missing) | 2466137 |
| Value | Count | Frequency (%) |
| 0 | 277386 | |
| 1 | 12458 | 0.4% |
| 2 | 9672 | 0.3% |
| 3 | 8959 | 0.3% |
| 4 | 8664 | 0.3% |
| 5 | 8172 | 0.3% |
| 6 | 7513 | 0.3% |
| 7 | 7052 | 0.2% |
| 8 | 6908 | 0.2% |
| 9 | 6373 | 0.2% |
| Value | Count | Frequency (%) |
| 1741 | 1 | |
| 1711 | 1 | |
| 1468 | 1 | |
| 1441 | 1 | |
| 1403 | 1 | |
| 1343 | 1 | |
| 1306 | 1 | |
| 1294 | 1 | |
| 1283 | 1 | |
| 1272 | 1 |
DELAY_DUE_SECURITY
Real number (ℝ)
High correlation Missing Skewed Zeros
| Distinct | 172 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2466137 |
| Missing (%) | 82.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.1459307 |
| Minimum | 0 |
|---|---|
| Maximum | 1185 |
| Zeros | 531104 |
| Zeros (%) | 17.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.9 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 1185 |
| Range | 1185 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 3.5820528 |
|---|---|
| Coefficient of variation (CV) | 24.54626 |
| Kurtosis | 24510.749 |
| Mean | 0.1459307 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 101.21755 |
| Sum | 77907 |
| Variance | 12.831102 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 531104 | 17.7% |
| 15 | 106 | < 0.1% |
| 16 | 104 | < 0.1% |
| 9 | 93 | < 0.1% |
| 8 | 91 | < 0.1% |
| 18 | 89 | < 0.1% |
| 5 | 86 | < 0.1% |
| 6 | 84 | < 0.1% |
| 10 | 81 | < 0.1% |
| 19 | 81 | < 0.1% |
| Other values (162) | 1944 | 0.1% |
| (Missing) | 2466137 |
| Value | Count | Frequency (%) |
| 0 | 531104 | |
| 1 | 50 | < 0.1% |
| 2 | 52 | < 0.1% |
| 3 | 63 | < 0.1% |
| 4 | 66 | < 0.1% |
| 5 | 86 | < 0.1% |
| 6 | 84 | < 0.1% |
| 7 | 79 | < 0.1% |
| 8 | 91 | < 0.1% |
| 9 | 93 | < 0.1% |
| Value | Count | Frequency (%) |
| 1185 | 1 | |
| 377 | 1 | |
| 376 | 1 | |
| 366 | 1 | |
| 301 | 1 | |
| 300 | 1 | |
| 291 | 1 | |
| 286 | 1 | |
| 281 | 1 | |
| 280 | 1 |
DELAY_DUE_LATE_AIRCRAFT
Real number (ℝ)
High correlation Missing Zeros
| Distinct | 958 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 2466137 |
| Missing (%) | 82.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 25.471282 |
| Minimum | 0 |
|---|---|
| Maximum | 2557 |
| Zeros | 274849 |
| Zeros (%) | 9.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.9 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 30 |
| 95-th percentile | 117 |
| Maximum | 2557 |
| Range | 2557 |
| Interquartile range (IQR) | 30 |
Descriptive statistics
| Standard deviation | 55.766892 |
|---|---|
| Coefficient of variation (CV) | 2.1894026 |
| Kurtosis | 118.51256 |
| Mean | 25.471282 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 7.3878286 |
| Sum | 13598175 |
| Variance | 3109.9462 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 274849 | 9.2% |
| 15 | 6395 | 0.2% |
| 16 | 5817 | 0.2% |
| 17 | 5603 | 0.2% |
| 18 | 5284 | 0.2% |
| 19 | 5136 | 0.2% |
| 20 | 5017 | 0.2% |
| 21 | 4750 | 0.2% |
| 14 | 4528 | 0.2% |
| 13 | 4365 | 0.1% |
| Other values (948) | 212119 | 7.1% |
| (Missing) | 2466137 |
| Value | Count | Frequency (%) |
| 0 | 274849 | |
| 1 | 3827 | 0.1% |
| 2 | 3819 | 0.1% |
| 3 | 3694 | 0.1% |
| 4 | 3545 | 0.1% |
| 5 | 3533 | 0.1% |
| 6 | 3903 | 0.1% |
| 7 | 3874 | 0.1% |
| 8 | 4009 | 0.1% |
| 9 | 4040 | 0.1% |
| Value | Count | Frequency (%) |
| 2557 | 1 | |
| 2096 | 1 | |
| 2010 | 1 | |
| 1905 | 1 | |
| 1872 | 1 | |
| 1802 | 1 | |
| 1736 | 1 | |
| 1722 | 1 | |
| 1715 | 1 | |
| 1669 | 2 |
Interactions
Correlations
| AIRLINE | AIRLINE_CODE | AIRLINE_DOT | AIR_TIME | ARR_DELAY | ARR_TIME | CANCELLATION_CODE | CANCELLED | CRS_ARR_TIME | CRS_DEP_TIME | CRS_ELAPSED_TIME | DELAY_DUE_CARRIER | DELAY_DUE_LATE_AIRCRAFT | DELAY_DUE_NAS | DELAY_DUE_SECURITY | DELAY_DUE_WEATHER | DEP_DELAY | DEP_TIME | DISTANCE | DIVERTED | DOT_CODE | ELAPSED_TIME | FL_NUMBER | TAXI_IN | TAXI_OUT | WHEELS_OFF | WHEELS_ON | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| AIRLINE | 1.000 | 1.000 | 1.000 | 0.165 | 0.015 | 0.047 | 0.267 | 0.046 | 0.051 | 0.049 | 0.183 | 0.024 | 0.017 | 0.022 | 0.000 | 0.019 | 0.014 | 0.047 | 0.176 | 0.009 | 1.000 | 0.165 | 0.453 | 0.020 | 0.080 | 0.049 | 0.048 |
| AIRLINE_CODE | 1.000 | 1.000 | 1.000 | 0.165 | 0.015 | 0.047 | 0.267 | 0.046 | 0.051 | 0.049 | 0.183 | 0.024 | 0.017 | 0.022 | 0.000 | 0.019 | 0.014 | 0.047 | 0.176 | 0.009 | 1.000 | 0.165 | 0.453 | 0.020 | 0.080 | 0.049 | 0.048 |
| AIRLINE_DOT | 1.000 | 1.000 | 1.000 | 0.165 | 0.015 | 0.047 | 0.267 | 0.046 | 0.051 | 0.049 | 0.183 | 0.024 | 0.017 | 0.022 | 0.000 | 0.019 | 0.014 | 0.047 | 0.176 | 0.009 | 1.000 | 0.165 | 0.453 | 0.020 | 0.080 | 0.049 | 0.048 |
| AIR_TIME | 0.165 | 0.165 | 0.165 | 1.000 | 0.034 | 0.041 | 0.000 | 1.000 | 0.049 | -0.032 | 0.984 | 0.015 | -0.088 | 0.166 | 0.008 | -0.033 | 0.081 | -0.033 | 0.986 | 1.000 | -0.063 | 0.979 | -0.335 | 0.128 | 0.071 | -0.039 | 0.043 |
| ARR_DELAY | 0.015 | 0.015 | 0.015 | 0.034 | 1.000 | 0.110 | 0.000 | 1.000 | 0.110 | 0.126 | -0.027 | 0.204 | 0.352 | 0.002 | -0.009 | 0.129 | 0.656 | 0.164 | 0.003 | 1.000 | -0.036 | 0.099 | -0.039 | 0.117 | 0.273 | 0.169 | 0.116 |
| ARR_TIME | 0.047 | 0.047 | 0.047 | 0.041 | 0.110 | 1.000 | 0.000 | 1.000 | 0.900 | 0.734 | 0.036 | -0.075 | 0.130 | -0.005 | -0.003 | 0.008 | 0.129 | 0.755 | 0.045 | 0.029 | -0.003 | 0.041 | -0.001 | -0.028 | 0.022 | 0.770 | 0.978 |
| CANCELLATION_CODE | 0.267 | 0.267 | 0.267 | 0.000 | 0.000 | 0.000 | 1.000 | 1.000 | 0.089 | 0.092 | 0.071 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.154 | 0.079 | 1.000 | 0.160 | 0.000 | 0.146 | 0.000 | 0.000 | 0.153 | 0.000 |
| CANCELLED | 0.046 | 0.046 | 0.046 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.016 | 0.018 | 0.017 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.015 | 0.010 | 0.018 | 0.008 | 0.033 | 1.000 | 0.013 | 1.000 | 0.006 | 0.006 | 1.000 |
| CRS_ARR_TIME | 0.051 | 0.051 | 0.051 | 0.049 | 0.110 | 0.900 | 0.089 | 0.016 | 1.000 | 0.798 | 0.046 | -0.058 | 0.204 | -0.051 | -0.003 | 0.008 | 0.140 | 0.796 | 0.056 | 0.011 | 0.001 | 0.046 | -0.008 | -0.037 | 0.017 | 0.804 | 0.907 |
| CRS_DEP_TIME | 0.049 | 0.049 | 0.049 | -0.032 | 0.126 | 0.734 | 0.092 | 0.018 | 0.798 | 1.000 | -0.038 | -0.040 | 0.250 | -0.101 | -0.005 | 0.004 | 0.153 | 0.970 | -0.027 | 0.010 | 0.007 | -0.035 | -0.001 | -0.068 | -0.002 | 0.955 | 0.753 |
| CRS_ELAPSED_TIME | 0.183 | 0.183 | 0.183 | 0.984 | -0.027 | 0.036 | 0.071 | 0.017 | 0.046 | -0.038 | 1.000 | 0.032 | -0.068 | 0.109 | 0.008 | -0.029 | 0.076 | -0.038 | 0.979 | 0.013 | -0.022 | 0.974 | -0.312 | 0.166 | 0.112 | -0.043 | 0.037 |
| DELAY_DUE_CARRIER | 0.024 | 0.024 | 0.024 | 0.015 | 0.204 | -0.075 | 0.000 | 1.000 | -0.058 | -0.040 | 0.032 | 1.000 | -0.251 | -0.371 | -0.063 | -0.235 | 0.305 | -0.024 | 0.042 | 1.000 | -0.054 | -0.043 | -0.019 | -0.119 | -0.138 | -0.035 | -0.072 |
| DELAY_DUE_LATE_AIRCRAFT | 0.017 | 0.017 | 0.017 | -0.088 | 0.352 | 0.130 | 0.000 | 1.000 | 0.204 | 0.250 | -0.068 | -0.251 | 1.000 | -0.291 | -0.016 | -0.050 | 0.457 | 0.281 | -0.059 | 1.000 | -0.071 | -0.150 | -0.017 | -0.088 | -0.203 | 0.263 | 0.138 |
| DELAY_DUE_NAS | 0.022 | 0.022 | 0.022 | 0.166 | 0.002 | -0.005 | 0.000 | 1.000 | -0.051 | -0.101 | 0.109 | -0.371 | -0.291 | 1.000 | -0.013 | -0.010 | -0.377 | -0.117 | 0.094 | 1.000 | 0.130 | 0.321 | -0.060 | 0.292 | 0.447 | -0.093 | -0.008 |
| DELAY_DUE_SECURITY | 0.000 | 0.000 | 0.000 | 0.008 | -0.009 | -0.003 | 0.000 | 1.000 | -0.003 | -0.005 | 0.008 | -0.063 | -0.016 | -0.013 | 1.000 | -0.017 | 0.001 | -0.005 | 0.011 | 1.000 | 0.011 | 0.003 | -0.019 | -0.003 | -0.014 | -0.006 | -0.003 |
| DELAY_DUE_WEATHER | 0.019 | 0.019 | 0.019 | -0.033 | 0.129 | 0.008 | 0.000 | 1.000 | 0.008 | 0.004 | -0.029 | -0.235 | -0.050 | -0.010 | -0.017 | 1.000 | 0.109 | 0.015 | -0.036 | 1.000 | 0.057 | -0.015 | 0.053 | -0.004 | 0.074 | 0.016 | 0.008 |
| DEP_DELAY | 0.014 | 0.014 | 0.014 | 0.081 | 0.656 | 0.129 | 0.000 | 0.015 | 0.140 | 0.153 | 0.076 | 0.305 | 0.457 | -0.377 | 0.001 | 0.109 | 1.000 | 0.198 | 0.089 | 0.011 | -0.174 | 0.079 | -0.084 | -0.050 | 0.024 | 0.194 | 0.137 |
| DEP_TIME | 0.047 | 0.047 | 0.047 | -0.033 | 0.164 | 0.755 | 0.154 | 0.010 | 0.796 | 0.970 | -0.038 | -0.024 | 0.281 | -0.117 | -0.005 | 0.015 | 0.198 | 1.000 | -0.028 | 0.010 | 0.001 | -0.034 | 0.001 | -0.065 | 0.004 | 0.984 | 0.774 |
| DISTANCE | 0.176 | 0.176 | 0.176 | 0.986 | 0.003 | 0.045 | 0.079 | 0.018 | 0.056 | -0.027 | 0.979 | 0.042 | -0.059 | 0.094 | 0.011 | -0.036 | 0.089 | -0.028 | 1.000 | 0.013 | -0.086 | 0.961 | -0.354 | 0.113 | 0.056 | -0.035 | 0.048 |
| DIVERTED | 0.009 | 0.009 | 0.009 | 1.000 | 1.000 | 0.029 | 1.000 | 0.008 | 0.011 | 0.010 | 0.013 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.011 | 0.010 | 0.013 | 1.000 | 0.007 | 1.000 | 0.003 | 0.018 | 0.013 | 0.010 | 0.029 |
| DOT_CODE | 1.000 | 1.000 | 1.000 | -0.063 | -0.036 | -0.003 | 0.160 | 0.033 | 0.001 | 0.007 | -0.022 | -0.054 | -0.071 | 0.130 | 0.011 | 0.057 | -0.174 | 0.001 | -0.086 | 0.007 | 1.000 | -0.006 | 0.338 | 0.214 | 0.276 | 0.005 | -0.005 |
| ELAPSED_TIME | 0.165 | 0.165 | 0.165 | 0.979 | 0.099 | 0.041 | 0.000 | 1.000 | 0.046 | -0.035 | 0.974 | -0.043 | -0.150 | 0.321 | 0.003 | -0.015 | 0.079 | -0.034 | 0.961 | 1.000 | -0.006 | 1.000 | -0.308 | 0.212 | 0.203 | -0.036 | 0.043 |
| FL_NUMBER | 0.453 | 0.453 | 0.453 | -0.335 | -0.039 | -0.001 | 0.146 | 0.013 | -0.008 | -0.001 | -0.312 | -0.019 | -0.017 | -0.060 | -0.019 | 0.053 | -0.084 | 0.001 | -0.354 | 0.003 | 0.338 | -0.308 | 1.000 | -0.023 | 0.086 | 0.008 | -0.005 |
| TAXI_IN | 0.020 | 0.020 | 0.020 | 0.128 | 0.117 | -0.028 | 0.000 | 1.000 | -0.037 | -0.068 | 0.166 | -0.119 | -0.088 | 0.292 | -0.003 | -0.004 | -0.050 | -0.065 | 0.113 | 0.018 | 0.214 | 0.212 | -0.023 | 1.000 | 0.068 | -0.065 | -0.036 |
| TAXI_OUT | 0.080 | 0.080 | 0.080 | 0.071 | 0.273 | 0.022 | 0.000 | 0.006 | 0.017 | -0.002 | 0.112 | -0.138 | -0.203 | 0.447 | -0.014 | 0.074 | 0.024 | 0.004 | 0.056 | 0.013 | 0.276 | 0.203 | 0.086 | 0.068 | 1.000 | 0.024 | 0.025 |
| WHEELS_OFF | 0.049 | 0.049 | 0.049 | -0.039 | 0.169 | 0.770 | 0.153 | 0.006 | 0.804 | 0.955 | -0.043 | -0.035 | 0.263 | -0.093 | -0.006 | 0.016 | 0.194 | 0.984 | -0.035 | 0.010 | 0.005 | -0.036 | 0.008 | -0.065 | 0.024 | 1.000 | 0.789 |
| WHEELS_ON | 0.048 | 0.048 | 0.048 | 0.043 | 0.116 | 0.978 | 0.000 | 1.000 | 0.907 | 0.753 | 0.037 | -0.072 | 0.138 | -0.008 | -0.003 | 0.008 | 0.137 | 0.774 | 0.048 | 0.029 | -0.005 | 0.043 | -0.005 | -0.036 | 0.025 | 0.789 | 1.000 |
Missing values
Sample
| FL_DATE | AIRLINE | AIRLINE_DOT | AIRLINE_CODE | DOT_CODE | FL_NUMBER | ORIGIN | ORIGIN_CITY | DEST | DEST_CITY | CRS_DEP_TIME | DEP_TIME | DEP_DELAY | TAXI_OUT | WHEELS_OFF | WHEELS_ON | TAXI_IN | CRS_ARR_TIME | ARR_TIME | ARR_DELAY | CANCELLED | CANCELLATION_CODE | DIVERTED | CRS_ELAPSED_TIME | ELAPSED_TIME | AIR_TIME | DISTANCE | DELAY_DUE_CARRIER | DELAY_DUE_WEATHER | DELAY_DUE_NAS | DELAY_DUE_SECURITY | DELAY_DUE_LATE_AIRCRAFT | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 2019-01-09 | United Air Lines Inc. | United Air Lines Inc.: UA | UA | 19977 | 1562 | FLL | Fort Lauderdale, FL | EWR | Newark, NJ | 1155 | 1151.0 | -4.0 | 19.0 | 1210.0 | 1443.0 | 4.0 | 1501 | 1447.0 | -14.0 | 0.0 | NaN | 0.0 | 186.0 | 176.0 | 153.0 | 1065.0 | NaN | NaN | NaN | NaN | NaN |
| 1 | 2022-11-19 | Delta Air Lines Inc. | Delta Air Lines Inc.: DL | DL | 19790 | 1149 | MSP | Minneapolis, MN | SEA | Seattle, WA | 2120 | 2114.0 | -6.0 | 9.0 | 2123.0 | 2232.0 | 38.0 | 2315 | 2310.0 | -5.0 | 0.0 | NaN | 0.0 | 235.0 | 236.0 | 189.0 | 1399.0 | NaN | NaN | NaN | NaN | NaN |
| 2 | 2022-07-22 | United Air Lines Inc. | United Air Lines Inc.: UA | UA | 19977 | 459 | DEN | Denver, CO | MSP | Minneapolis, MN | 954 | 1000.0 | 6.0 | 20.0 | 1020.0 | 1247.0 | 5.0 | 1252 | 1252.0 | 0.0 | 0.0 | NaN | 0.0 | 118.0 | 112.0 | 87.0 | 680.0 | NaN | NaN | NaN | NaN | NaN |
| 3 | 2023-03-06 | Delta Air Lines Inc. | Delta Air Lines Inc.: DL | DL | 19790 | 2295 | MSP | Minneapolis, MN | SFO | San Francisco, CA | 1609 | 1608.0 | -1.0 | 27.0 | 1635.0 | 1844.0 | 9.0 | 1829 | 1853.0 | 24.0 | 0.0 | NaN | 0.0 | 260.0 | 285.0 | 249.0 | 1589.0 | 0.0 | 0.0 | 24.0 | 0.0 | 0.0 |
| 4 | 2020-02-23 | Spirit Air Lines | Spirit Air Lines: NK | NK | 20416 | 407 | MCO | Orlando, FL | DFW | Dallas/Fort Worth, TX | 1840 | 1838.0 | -2.0 | 15.0 | 1853.0 | 2026.0 | 14.0 | 2041 | 2040.0 | -1.0 | 0.0 | NaN | 0.0 | 181.0 | 182.0 | 153.0 | 985.0 | NaN | NaN | NaN | NaN | NaN |
| 5 | 2019-07-31 | Southwest Airlines Co. | Southwest Airlines Co.: WN | WN | 19393 | 665 | DAL | Dallas, TX | OKC | Oklahoma City, OK | 1010 | 1237.0 | 147.0 | 15.0 | 1252.0 | 1328.0 | 3.0 | 1110 | 1331.0 | 141.0 | 0.0 | NaN | 0.0 | 60.0 | 54.0 | 36.0 | 181.0 | 141.0 | 0.0 | 0.0 | 0.0 | 0.0 |
| 6 | 2023-06-11 | American Airlines Inc. | American Airlines Inc.: AA | AA | 19805 | 2134 | DCA | Washington, DC | BOS | Boston, MA | 1010 | 1001.0 | -9.0 | 23.0 | 1024.0 | 1122.0 | 8.0 | 1159 | 1130.0 | -29.0 | 0.0 | NaN | 0.0 | 109.0 | 89.0 | 58.0 | 399.0 | NaN | NaN | NaN | NaN | NaN |
| 7 | 2019-07-08 | Republic Airline | Republic Airline: YX | YX | 20452 | 4464 | HSV | Huntsville, AL | DCA | Washington, DC | 1643 | 1637.0 | -6.0 | 22.0 | 1659.0 | 1927.0 | 41.0 | 1945 | 2008.0 | 23.0 | 0.0 | NaN | 0.0 | 122.0 | 151.0 | 88.0 | 613.0 | 0.0 | 0.0 | 23.0 | 0.0 | 0.0 |
| 8 | 2023-02-12 | Spirit Air Lines | Spirit Air Lines: NK | NK | 20416 | 590 | IAH | Houston, TX | LAX | Los Angeles, CA | 530 | 527.0 | -3.0 | 11.0 | 538.0 | 658.0 | 8.0 | 717 | 706.0 | -11.0 | 0.0 | NaN | 0.0 | 227.0 | 219.0 | 200.0 | 1379.0 | NaN | NaN | NaN | NaN | NaN |
| 9 | 2020-08-22 | Alaska Airlines Inc. | Alaska Airlines Inc.: AS | AS | 19930 | 223 | SEA | Seattle, WA | FAI | Fairbanks, AK | 2125 | 2116.0 | -9.0 | 19.0 | 2135.0 | 2353.0 | 3.0 | 2355 | 2356.0 | 1.0 | 0.0 | NaN | 0.0 | 210.0 | 220.0 | 198.0 | 1533.0 | NaN | NaN | NaN | NaN | NaN |
| FL_DATE | AIRLINE | AIRLINE_DOT | AIRLINE_CODE | DOT_CODE | FL_NUMBER | ORIGIN | ORIGIN_CITY | DEST | DEST_CITY | CRS_DEP_TIME | DEP_TIME | DEP_DELAY | TAXI_OUT | WHEELS_OFF | WHEELS_ON | TAXI_IN | CRS_ARR_TIME | ARR_TIME | ARR_DELAY | CANCELLED | CANCELLATION_CODE | DIVERTED | CRS_ELAPSED_TIME | ELAPSED_TIME | AIR_TIME | DISTANCE | DELAY_DUE_CARRIER | DELAY_DUE_WEATHER | DELAY_DUE_NAS | DELAY_DUE_SECURITY | DELAY_DUE_LATE_AIRCRAFT | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2999990 | 2023-07-26 | SkyWest Airlines Inc. | SkyWest Airlines Inc.: OO | OO | 20304 | 4126 | DTW | Detroit, MI | MSN | Madison, WI | 825 | 824.0 | -1.0 | 32.0 | 856.0 | 851.0 | 5.0 | 843 | 856.0 | 13.0 | 0.0 | NaN | 0.0 | 78.0 | 92.0 | 55.0 | 311.0 | NaN | NaN | NaN | NaN | NaN |
| 2999991 | 2021-12-03 | Delta Air Lines Inc. | Delta Air Lines Inc.: DL | DL | 19790 | 1146 | MSP | Minneapolis, MN | SNA | Santa Ana, CA | 1825 | 1833.0 | 8.0 | 23.0 | 1856.0 | 2015.0 | 7.0 | 2030 | 2022.0 | -8.0 | 0.0 | NaN | 0.0 | 245.0 | 229.0 | 199.0 | 1522.0 | NaN | NaN | NaN | NaN | NaN |
| 2999992 | 2019-01-13 | JetBlue Airways | JetBlue Airways: B6 | B6 | 20409 | 1668 | CHS | Charleston, SC | BOS | Boston, MA | 1258 | 1245.0 | -13.0 | 15.0 | 1300.0 | 1438.0 | 4.0 | 1510 | 1442.0 | -28.0 | 0.0 | NaN | 0.0 | 132.0 | 117.0 | 98.0 | 818.0 | NaN | NaN | NaN | NaN | NaN |
| 2999993 | 2019-12-23 | Allegiant Air | Allegiant Air: G4 | G4 | 20368 | 1899 | SRQ | Sarasota/Bradenton, FL | IND | Indianapolis, IN | 907 | 905.0 | -2.0 | 11.0 | 916.0 | 1106.0 | 9.0 | 1125 | 1115.0 | -10.0 | 0.0 | NaN | 0.0 | 138.0 | 130.0 | 110.0 | 876.0 | NaN | NaN | NaN | NaN | NaN |
| 2999994 | 2020-08-31 | Delta Air Lines Inc. | Delta Air Lines Inc.: DL | DL | 19790 | 1408 | FLL | Fort Lauderdale, FL | LGA | New York, NY | 700 | 653.0 | -7.0 | 16.0 | 709.0 | 927.0 | 6.0 | 944 | 933.0 | -11.0 | 0.0 | NaN | 0.0 | 164.0 | 160.0 | 138.0 | 1076.0 | NaN | NaN | NaN | NaN | NaN |
| 2999995 | 2022-11-13 | American Airlines Inc. | American Airlines Inc.: AA | AA | 19805 | 1522 | JAX | Jacksonville, FL | CLT | Charlotte, NC | 1742 | 1740.0 | -2.0 | 10.0 | 1750.0 | 1845.0 | 6.0 | 1907 | 1851.0 | -16.0 | 0.0 | NaN | 0.0 | 85.0 | 71.0 | 55.0 | 328.0 | NaN | NaN | NaN | NaN | NaN |
| 2999996 | 2022-11-02 | American Airlines Inc. | American Airlines Inc.: AA | AA | 19805 | 1535 | ORD | Chicago, IL | AUS | Austin, TX | 1300 | 1254.0 | -6.0 | 10.0 | 1304.0 | 1514.0 | 5.0 | 1556 | 1519.0 | -37.0 | 0.0 | NaN | 0.0 | 176.0 | 145.0 | 130.0 | 977.0 | NaN | NaN | NaN | NaN | NaN |
| 2999997 | 2022-09-11 | Delta Air Lines Inc. | Delta Air Lines Inc.: DL | DL | 19790 | 2745 | HSV | Huntsville, AL | ATL | Atlanta, GA | 534 | 615.0 | 41.0 | 16.0 | 631.0 | 759.0 | 6.0 | 729 | 805.0 | 36.0 | 0.0 | NaN | 0.0 | 55.0 | 50.0 | 28.0 | 151.0 | 0.0 | 36.0 | 0.0 | 0.0 | 0.0 |
| 2999998 | 2019-11-13 | Republic Airline | Republic Airline: YX | YX | 20452 | 6134 | BOS | Boston, MA | LGA | New York, NY | 1600 | 1555.0 | -5.0 | 19.0 | 1614.0 | 1704.0 | 8.0 | 1728 | 1712.0 | -16.0 | 0.0 | NaN | 0.0 | 88.0 | 77.0 | 50.0 | 184.0 | NaN | NaN | NaN | NaN | NaN |
| 2999999 | 2019-06-15 | Southwest Airlines Co. | Southwest Airlines Co.: WN | WN | 19393 | 2823 | LGB | Long Beach, CA | SJC | San Jose, CA | 730 | 727.0 | -3.0 | 9.0 | 736.0 | 828.0 | 2.0 | 840 | 830.0 | -10.0 | 0.0 | NaN | 0.0 | 70.0 | 63.0 | 52.0 | 324.0 | NaN | NaN | NaN | NaN | NaN |